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This article presents a parametric-complexity exact algorithm to solve the graph 3-COLORING problem. Its 
maximal complexity is controlled by the parameter a G N which determines the running-time: 0{nf^). 
The algorithm relies on the efficient search of "3-uncolorability witnesses" which, as it is shown here, is 
in NPnCoNPnP for any fixed a, and in general for a ^ /(input). The algorithm always returns either a 
legal 3-coloring, a 3-uncolorability witness or alternatively "undetermined". Hence, it is not compulsory to 
trust neither the correctness of the algorithm itself nor the particular implementation used in order to be 
convinced that the obtained solution is correct. In the "undetermined" case, it can be re-run with a higher a 
until a solution is found, and so, a unique number o(G) S N corresponds to each graph G. Here, it is proved, 
for all k 6 N, that the proportion of graphs such that a(G) = k is greater than or equal to the proportion 
for a(G) > k, hence the probability of a{G) > k has an upper-bound of P(a(G) > k) < l/2 k+1 and thus 
lim^oo P(a(G) < k) = 1. As a corollary, it is shown that asa^oo (NP U CoNP) \ P -> and that P ^ NP 
implies a{G) 6 [0, oo). Hence, 3-colorability restricted to a(G) < k is in P. Nevertheless, it remains unknown 
whether it is NP-complete for some finite k. Experimental analyses have been thoroughly performed. The 
algorithm has been tested on planar graphs, 4-regular planar graphs and Erdos-Renyi random graphs. The 
experimental results validate the algorithmic implementation and confirm the theoretical expected results: 
for planar graphs, it was found that a = 1 sufficed to obtain a solution in all the tests, while for random 
graphs, it has been observed that the distribution of a(G) obeys the given theoretical bounds. 
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1. INTRODUCTION 

Graph Coloring is one of the oldest and popular Constraint Satisfaction Problems 
(CSPs) [Jones 2008]. The study of efficient CSPs solving algorithms, and specifically 
for the graph coloring problem, is a central topic in Computer Science and Artificial 
Intelligence due to its wide applicability in many engineering projects, e.g., VLSI test- 
ing, planning and scheduling, timetabling, satellite range scheduling, register alloca- 
tion, printed circuit testing and frequency assignment [Wigderson 1983; Park and Lee 
1996; Ramani et al. 2004], as well as theoretical models, e.g., spin-glasses and the 
anti-ferromagnetic Potts model [Zdeborova and Krzakala 2007]. 

The problem of deciding if a given graph can be colored with k or less colors is called 
the fc-colorability problem. For k = 2, the colorability problem can be solved efficiently 
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However, for k > 3, in general, there is no known efficient algorithm and the problem 
is NP-complete 1 [Cook 1971; Karp 1972; Levin 1973; Garey and Johnson 1979]. 

Direct methods are just one interesting type among all the diverse methods used to 
approach NP-complete problems. They look for the determination of simple properties 
(e.g. triangle freeness) that impose necessary and sufficient conditions for determining 
the class of an instance (Yes or No). For the three-color-problem [Steinberg 1993] two 
examples are: phase-transition studies [Hogg et al. 1996; Culberson and Gent 2001; 
Mulet et al. 2002; Boettcher and Percus 2004; Zdeborova and Krzakala 2007], based on 
random graphs theory and sharp-thresholds [Erdos and Renyi 1960; Erdos et al. 1976], 
and pure structural-combinatorial approaches, based on, for example, the existence or 
absence of particular cycle configurations (see Steinberg 1993; Borodin 1996; Borodin 
et al. 2005; Wang and Chen 2007; Borodin et al. 2009). One of the foundational results 
along this approach is the Grotzsch's 3-color theorem [Thomassen 1994]: triangle-free 
planar graphs are 3-colorable. 

On the other hand, given the intractability of the NP-complete problems, research on 
approximation algorithms started early, e.g., Johnson [1974a; 1974b], Garey and John- 
son [1976], see also Wigderson [1983] and Berger and Rompel [1990]. The best known 
approximation guarantees for general 3-coloring was due to Halldorsson [1993], soon 
improved by Blum [1994] and more recently by Arora and Chlamtac [2006]. However, 
even for the approximate case graph coloring remains hard [Karger et al. 1998]. More 
strictly, it is NP-hard to even find a 4-coloring of a 3-chromatic graph [Khanna et al. 
2000]. 

An alternative approach of more recent development, to the classical worst-case com- 
putational complexity theory, is Parameterized Complexity [Downey and Fellows 1999; 
Flum and Grohe 2006]. In Parameterized Complexity, apart from the problem instance 
itself, there is a parameter (usually an integer) that may be associated arbitrarily with 
the problem instance, allowing to study a problem's complexity with respect to both the 
size of the input (as in classical computational complexity) and the provided parame- 
ter. One of the main virtues of the parametric complexity approach is the concept of 
fixed-parameter tractability. 

In this article a parametric-complexity exact algorithm, A(G,a), to solve the graph 
3-coloring problem is presented. Two key-differences with respect to past graph color- 
ing approaches are that it is (1) exact and (2) polynomial -time (however parametric) 
while previous published algorithms are either low-exponential exact or polynomial- 
time (but approximate). 

The algorithm introduced here relies on the search of 3-uncolorability witnesses. The 
definition of 3-uncolorability witness presented here is (to the best of my knowledge) 
the first one that is formally defined and the most naturally related to the 3-coloring 
problem. A 3-uncolorability witness is defined 2 as a sequence of "forced vertex contrac- 
tions" leading to a graph containing a K±. A verifier can check that every contraction 
is "forced" efficiently. The possibility of efficiently generating 3-uncolorablity witnesses 
is of theoretical interest since 3-colorability is NP-complete and thus 3-uncolorablity 
is CoNP-complete. 

Some works have studied short proofs systems for CoNP-complete problems (e.g. 
Boppana et al. 1987; Fortnow and Sipser 1988). In particular, for graph coloring, there 
is a recent work [Bes and Jegou 2005] that looks for proving graph uncolorability with 



lr The computational complexity terminology and concepts (e.g. P, NP, CoNP, NP-completeness, NP-Hardness, 
certificate/witness, reductions) can be consulted in the recent books of Arora and Barak [2009] and Goldreich 
[2008], 

2 A 3-uncolorability witness can also be initially understood as a (pruned) Zykov-tree (see Corneil and Gra- 
ham 1973) in which every leaf is a graph containing a A'4 subgraph. 
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consistency checks of constraint satisfaction problems. However, short 3-uncolorability 
witnesses are not possible in general unless NP = CoNP, which is unknown and indeed 
unlikely. 

The proposed A(G, a) algorithm has several "good" key-features: 

(1) The algorithm's complexity is controlled by means of a simple parameter (a, the 
maximum recursion level). The parameter determines the order of the polynomial 
that bounds the running-time of the algorithm: 0(n^ a ^). 

(2) The algorithm generates witnesses for both Yes and No instances, i.e., either a 
legal 3-coloring or a 3-uncolorability witness. 

(3) For any fixed a, and in general for a ^ /(input), finding a 3-uncolorability witness 
or a legal 3-coloring is in: NP n CoNP n P. 

(4) If for a given a the algorithm is unable to find a witness then it returns "undeter- 
mined" so that it can be re-run with a higher a up to a solution is obtained, thus, 
to each input graph G, the algorithm assigns a unique a, i.e., a = f(G). 

(5) The possibility of self-tuning the a parameter (by using the undetermined return 
value) gives the algorithm the capability of using only the required computational 
resources for a particular problem instance, e.g., for almost all planar graphs it 
is enough a value of a = to obtain the right solution, obtaining thus an 0(n 2 ) 
algorithm for the 3-coloring problem in almost all planar graphs. 

(6) Also, apart from computational complexity results, a very important feature of the 
algorithm is that it gives stand-alone indubitable results and therefore it is not 
necessary to trust the correctness of the algorithm itself, nor the particular imple- 
mentation used, to recognize that the provided solution is correct, since the veri- 
fication of the result can be done efficiently using just only the solution provided 
(the witness or certificate). 

Moreover, from the theoretical point of view, the most important property of the al- 
gorithm is the classification of all random graphs by the number a(G): the minimum 
value of the parameter a, for which the algorithm is able to obtain a witness, given a 
particular Yes/No instance G of the 3-coloring problem. This brings important conse- 
quences and allows the development a more thorough analysis: 

THEOREM 1.1. Let G be a random graph, ifP(a = k) is the probability that a(G) = 
k and P(a > k) is the probability that a(G) > k, for all k e N, then: 

P(a = k) > P(a > k) (1) 

COROLLARY 1.2. If G is a random graph and P(a > k) is the probability that 
a(G) > k, for all k e N, then: 

lim P(a <k)= 1 (3) 

k— >oo 

COROLLARY 1.3. if a is the parameter of the A(G, a) algorithm then: 

asa^co (NP U CoNP) \ P -> (4) 

Finally: 

LEMMA 1.4. IfP ^ NP then a(G) e [0, oo), 

which means that: if a(G) is finite, i.e., a(G) e [0, n] for some n e N, then P = NP. 

Even assuming the infiniteness of a(G), the problem now is not whether a polyno- 
mial algorithm exists or not, but instead, to determine whether 3-colorability for a 
class of graphs with a(G) < n is NP-complete. 
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1.1. Experimental analysis 

The problem of evaluating algorithms experimentally could be very tricky if tests are 
performed on "artificial instances", which may be uncorrelated or isolated from any 
specific practical application. Nevertheless, there are some lines of research suggest- 
ing special distributions of graph instances on which purported NP-complete problem 
solvers should be evaluated to determine properly their performances (see Selman 
et al. 1996; Culberson and Gent 2001; Mizuno and Nishihara 2008). Moreover, apart 
from the "harder instances approach", Johnson [2002] has proposed a more complete 
methodological approach to the experimental analysis of algorithms. 

In the experimental part of this article, the algorithm is thoroughly evaluated over 
significant samples pertaining to three different graph distributions: 

(1) Pseudo-random planar graphs [Denise et al. 1996; Bodirsky et al. 2003]. 

(2) Random 4-regular planar graphs [Manca 1979; Lehel 1981; Broersma et al. 1993]. 

(3) Erdos-Renyi connected random graphs [Erdos and Renyi I960]. 

Each class, and each distribution, has a good justification: 

(1) Planarity imposes some interesting structural properties, i.e., the three color 
problem on planar graphs is the only unqualified problem that remains open [Stein- 
berg 1993] since 1-coloring is trivial, 2-coloring is well characterized and the maximum 
chromatic number on the plane is four [Appel et al. 1977a; Appel et al. 1977b], and at 
the same time determining 3-colorability of planar graphs is NP-complete [Stockmeyer 
1973; Garey and Johnson 1979]. 

(2) Even more, 3-colorability of four-regular planar graphs still remains NP- 
complete [Dailey 1980] and most important, in this class the average degree is fixed so 
the phase-transition phenomenon as defined for random graphs could not be applied 
directly in this case. 

(3) Finally, sampling from the Erdos-Renyi (connected) random graphs distribution 
gives the necessary theoretical support for evaluating an algorithm in the general case, 
validating the theoretical bounds and allowing to obtain results that could be com- 
pared against other algorithms in the literature, e.g., the best performing 3-coloring 
algorithms proposed in the literature are surveyed by Malaguti and Toth [2010]. 

An interesting experimental finding is that in all the test cases for planar graphs, 
it was found that fixing the maximum recursion level to a = 1 was enough to obtain 
a solution, that is, exact efficiently verifiable results were obtained with a polynomial 
algorithm. This was also the case for 4-regular planar graphs. 

Furthermore, in the general 3-coloring case (random graphs) experiments, it has 
been observed that the distribution of a(G) conforms to the theoretical decreasing pat- 
tern: the majority of graphs are in a(G) = or a(G) = 1, some in a(G) — 2, a few in 
a(G) — 3, very few in a(G) = 4 and so on. However, it was not possible to obtain a 
graph with a(G) > 4 in the sampled random graphs. 

As observed in the experiments, the value of a(G) is directly correlated with the av- 
erage degree d = 2m/n (m = edges, n = vertices), so near the phase transition threshold 
{d*) the probability of a higher value of a(G) increases. 



However, many interesting questions remain open, e.g.: 

— What is the exact distribution of a(G) in random graphs? 

— Apart from the average degree, what other parameters are related to a(G)? 



d* 



{ 



4.69, Mulet et al. [2002]; 

4.703, Boettcher and Percus [2004]; 



(5) 
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A:K 4 B:K 112 C: T 31 D: P 2 E: C 4 



Fig. 1: (A) the complete graph K4. (B) a complete 3-partite graph Ku2- (C) a tadpole 
graph T 3 i which imposes a binary constraint on 3-coloring: either G/xw or G/yw. (D) 
the path of length two P 2 . (E) a C 4 graph that imposes a binary constraint on 3-coloring: 
either G/xz or G/yw. 



— Given an arbitrary input graph G, can it be predicted (exactly or approximately) the 
a(G) value and what is the best possible approximation to a(G)7 

— As for the chromatic number x(G), is there any graph construction mechanism that 
allows the generation of graphs with arbitrarily large a(G)? 

1.2. Preliminary definitions, terminology and notation 

This article follows the standard graph theory terminology. For general terms and 
notation the book of Jensen and Toft [1995] and the recent book, on chromatic graph 
theory, by Chartrand and Zhang [2008] should be consulted. However, some special 
terms and specific notations are defined next. 

Unless we state it otherwise, all graphs in this work are connected and simple (finite, 
and have no loops or parallel edges). A graph is called planar if it can be drawn in a 
plane in such a way that its edges intersect only in its end-points, i.e., there are no 
edges crossing. 

Werefer to u, v as a planar preserving edge if uv is an edge of the complement of G, 
uv e G, and G + uv remains planar. 

Sets are specified using {a, b, c} notation and H c G or H C G reads H is a proper 
subgraph of G or a subgraph of G respectively. A vertex contraction, also called ver- 
tex identification or vertex merging, is denoted by G/uv. Vertex or edge addition and 
deletion are denoted as: G + u or G + uv and G — u or G — uv respectively. 

The special named graphs used in the article are: the complete graph K it the com- 
plete 3-partite graph Km (a K± minus one vertex), the tadpole graph T31 obtained by 
joining a triangle T 3 to a single vertex with a bridge, the triangle graph T 3 , consisting 
of three fully connected vertices, e.g., {a;, y, z} such that xy, yz, xz are edges, the path 
graph P of length two P 2 and the square graph C4 which is a cycle of length four. 

2. DEFINITIONS AND THEORY 

The algorithm introduced here relies heavily on Grotzsch's theorem since the key pro- 
cedures are based on triangle (or triangle configurations) processing to search for a 
3-uncolorability witness. 

A witness (or certificate, Arora and Barak [2009]) is an efficiently verifiable proof of 
the correctness of an answer for some given decision problem. For instance, given a 
graph G, a legal 3-coloring of G or a short-proof of the uncolorability of G with only 
three colors are witnesses for the 3-colorability problem. 
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Definition 2,1. Given a graph G and a 3-colorable subgraph H of G, there is a forced 
vertex contraction 3 on {u, u} e V^(G) if adding the edge uv to H makes H not 3-colorable. 

Definition 2.2. Given a non-3-colorable input graph G, a 3-uncolorability witness 
W is a description of a (possibly empty) sequence of vertex contractions, G/uv, leading 
to a graph containing K^, such that either: 

(1) u, v are the non-complete vertices of a complete 3-partite K 112 subgraph of G or; 

(2) a nested 3-uncolorability witness for the graph G + uv is provided. 

THEOREM 2.3. Finding a 3-uncolorability witness is in: NP n CoNP n P; for any 
fixed recursion level a = k 6 N. 

Proof. 

(1) The problem is in NP: 

— Certificate: the 3-uncolorability witness (W) itself. 

— Polynomial Certifier: check if each element of W is a forced vertex contraction 
and perform it up to obtaining a K t subgraph. If the recursion level a is constant 
then the size of the witness is bounded by a polynomial: 

For a recursion level of a = 0, meaning that W contains only ver- 
tex contractions based on Kn 2 subgraphs and no other nested wit- 
nesses, the size of W is bounded by the maximum number of edges of 
G which is of order 0(n 2 ). Assuming that, for every constant a = k, 
it is bounded by a polynomial P k then we can specify the order of a 
recursion level of a = k + 1 as P k x 0(n 2 ), which is also a polynomial 
on n = \V(G)\, since every nested witness is of order 0(n 2 ). 

(2) Moreover, 3-uncolorability is the complementary problem of graph 3-colorability, 
an NP-complete problem. Hence 3-uncolorability is CoNP-complete. 

(3) That it is also in P for a fixed a follows from the existence of a polynomial algorithm 
to solve it. 

□ 

Now, let us define a greedy 3-coloring algorithm that will serve as a baseline from 
where to derive the proposed coloring algorithm. 

Definition 2.4. The ,g 3 (G) algorithm is a "greedy" 3-coloring algorithm that sequen- 
tially, and at each step, selects two non-adjacent vertices x and y of a graph G and 
contracts them to obtain the graph G/xy, maintaining a list S of the vertices that have 
been contracted so far, so that if in S are just only three independent sets then these 
are three color classes of G and hence a legal 3-coloring of G. 

The proposed parametric-complexity exact algorithm will use the same approach to 
3-coloring as the following lemma below (lemma 2.5): 

LEMMA 2.5. Given an exact algorithm W(G) of complexity 0(n k ) to obtain a 3- 
uncolor ability witness for any non- 3 -colorable graph then there is an exact algorithm 
W(G)i of complexity 0(n k+1 ) to obtain a 3-coloring of any 3-colorable graph. 



3 Two vertices of a graph are said to be 3-chromatically connected if they are assigned the same color in any 
3-coloring of the graph [Steinberg 1993]. Culberson and Gent [2001] use instead the term "frozen pair" and 
define the spine of a colorable graph G as: B(G) = {{u, v} : {u, v} is frozen by G}, as we can see, there is a 
direct relation between B(G) and the definition of the 3-uncolorability witness given here. The same concept 
for such a pair {u, v} is also defined as an implicit-identity by the current author who gives [Martin H. 2011] 
a more thorough study of this subject for the general case (k-chromatic and non-planar graphs). 
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Proof. Assume W(G) exists. 
Then, given a 3-colorable graph G, apply the greedy g 3 (G) algorithm but avoiding the 
contraction of every {x, y} such that G/xy is not 3-colorable, which can be determined 
in 0(n k ) by W(G/xy) . Since G is 3-colorable this will converge to a triangle graph. 
Since 33(G) is of complexity 0(n) (at every step at least one vertex gets colored) then 
we obtain a 3-coloring of G in 0{n k+l ) = 0(n).0(n k ) □ 

COROLLARY 2.6. So, if W(G, a) is an exact parametric algorithm, of complexity 
(9( n /( Q )) ; i obtain a 3 -uncolor ability witness for any non-3 -colorable graph then there is 
an exact parametric algorithm W(G, a)i of complexity 0{n^ k+1 ^) to obtain a 3-coloring 
of any 3-colorable graph. 

To formalize the analysis of the algorithm let us define the following two algorithm 
specifications: 

Definition 2.7. The A(G,a) is a parametric-complexity algorithm that computes a 
function that assign to a given input graph G just one of three possible values: 0, 1 or 
00, when G is respectively: non-3-colorable, 3-colorable or the algorithm was unable to 
find a solution for the given value of the a parameter: 

{0, No-instance: G is not 3-colorable; 
1, Yes-instance: G is 3-colorable; (6) 
00, undetermined for the given a 

Definition 2.8. The W(G,a) is a parametric-complexity algorithm that computes a 
function that assigns to a given input graph G just one of three possible values: a 3- 
uncolorability witness, a legal 3-coloring or a null value, when A(G, a) is respectively: 
0, 1 or 00. 

{a 3 -uncolor ability witness, A{G, a) = 0; 
a legal 3-coloring, A(G, a) = 1; (7) 

0, A(G,a) = 00 

Definition 2.9. Given a graph G, the a level of G: a(G), is the minimum a required 
to obtain a witness for a particular instance of the 3-(un)colorability problem: 

a(G) = min(a) s.t. W(G, a) ^ (8) 
2.1. Proof of the main theorem and corollaries 

Let us define the number N(4>(a)) as the cardinality of the set of all graphs that satisfy 
some logical function, 4>(a), of a, i.e.,: 

N(<f>(a)) = \{G e G* | 0(a)}| , (9) 

where G* is the set of all graphs. 

The key argument of the proof uses the assertion of lemma 2.5 that it suffices to 
be able to solve 3-uncolorability for some a in order to be able to solve 3 -color ability 
for the same a. To prove the theorem the following two auxiliary lemmas need to be 
proved first: 

LEMMA 2.10. Almost all graphs have a(G) = 0. 

Proof. As proved by Erdos et al. [1976] and improved by Kolaitis et al. [1987], 
almost all i^£ + i-free graphs are ^-colorable and hence almost all Kj-free graphs are 
3-colorable, i.e., if G is a K^-free random graph then with high probability it is 3- 
colorable. Thus, if we assume that G is not 3-colorable then with high probability it 
is not K^-free, that is, almost all non-3-colorable graphs contain a K±. So, almost all 
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non-3-colorable graphs have a(G) = 0. Then, by lemma 2.5, for almost all 3-colorable 
graphs a legal 3-coloring can be found also with a = and thus almost all 3-colorable 
graphs also have a(G) — 0. Therefore almost all graphs have a(G) = 0. □ 

LEMMA 2.11. If for some k € N almost all graphs have a(G) = k and almost all 
graphs with a(G) > k can be transformed into a graph with a(G) — k, by applying 
a series of random edge additions or vertex contractions, then almost all graphs with 
a(G) > k have a(G) = k + 1. 

PROOF. Before starting the proof, it is useful to show first that a(G) > fc, for all 
fc € N, is a monotone property 4 (see Alon et al. 2010). 

The key-idea of the proof is that, with high probability, by the application of a min- 
imal series of random edge additions and vertex contractions, almost all graphs with 
a(G) = k + 1 are transformed into a graph with a(G) = fc. And thus show by induction 
that: 

(1) almost all graphs with a(G) > have a{G) = 1 

(2) almost all graphs with a(G) > k have a[G) = k + 1 

(3) almost all graphs with a(G) > fc + 1 have a(G) = k + 2 

The proof is by induction on fc, starting at k = 0. Almost all graphs have a(G) = 0. 
The class of graphs with a(G) — is the class of if 4 -free graphs, even-wheels-free 
graphs and in general every witness that collapses to a K\ only by the identifications 
of Ki 12 subgraphs. Since the number of graph transformation operations on a graph 
G in order to obtain a graph H with a(H) = is a monotonically increasing function 
of a(G) then almost all graphs with a(G) > have a(G) — 1 since the probability of 
transforming a graph with a(G) = 1 into a graph with a(H) = 0, with fewer operations 
than for a(G) > 1, is bigger than for graphs with a(G) > 1. 

So, assume it holds for every fc, that is, almost all graphs with a(G) > k have a(G) = 
k + l, and show that then it holds for fc + 1. Let us suppose that almost all graphs with 
a(G) > k + 1 have a(G) > k + 2. Then, among all the graphs with a(G) > k + 2 the 
probability of obtaining a graph with a = k is bigger that the probability of obtaining 
a graph with a(G) = k for graphs with a(G) = k + 1, which contradicts the inductive 
steps since almost all graphs with a(G) > k have a(G) = k + 1. □ 

Hence, by the lemma 2.11, the theorem 1.1 holds for all k since: 

(1) By the lemma 2.10: N(a = 0) > N(a > 0). 

(2) For all fc, almost all graphs with a(G) > k have a(G) = k + l implies: 

N(a = fc) > N(a > fc), 

(3) Therefore, it follows for random graphs that: 

P(a= fc) > P(a> fc), Vfce[0,oo) 

This completes the proof. □ 

Statements of Corollary 1.2 follows immediately from N(a — fc) > N(a > fc). Corol- 
lary 1.3 is proved next: 

Proof, of Corollary 1.3 

Let NP(a) and CoNP(a) be respectively: 

(1) the set of all problem instances in NP which are in P for a given a. 

(2) the set of all problem instances in CoNP which are in P for a given a. 



4 A monotone property of a graph G is a property that holds for G and also for every arbitrary subgraph of 
G, e.g., K e -free graphs. 
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NP(a) ={£eNP:VGe£; A(G, a) ^ oo}; (10) 
CoNP(a) = {Ee CoNP :VGeE; A(G, a) ^ oo}, (11) 

where E is the class of 3-coloring instances (G) which are solvable in polynomial-time 
by the algorithm A(G, a) for some particular value of a. 

Then, since 3-(un)colorability witness is in NP n CoNP n P, for any fixed a, then 
NP(a) = CoNP(a) and so (NP(a) U CoNP(a)) \ P = 0. 

It is clear then that as a -> oo NP(a) -> NP and CoNP(a) -> CoNP, hence: as a -> oo 
(NP(a)) U CoNP(a)) \ P -> (NP U CoNP) \ P and hence (NP U CoNP) \ P -> 

This completes the proof. □ 

Finally, a proof of lemma 1.4 is direct since a finite a(G) implies P = NP. 
3. DESCRIPTION OF THE ALGORITHM 

The algorithm is divided in two parts: the decision problem (is_3colorable) and the 
coloring algorithms (_3CDL). There are two versions of the coloring algorithms: one 
for planar graphs (planar_3CDL) and one for non-planar graphs (general_3CDL). This 
last division is oriented to take advantage of some special structural constraints of 
planar graphs (e.g. Grotzsch's like theorems) that aid the development of more efficient 
algorithms. First, the algorithm for the planar graphs case is described, which is better 
for understanding the key-idea behind the algorithms, then it is generalized to the 
non-planar graphs case. 

3.1. The decision routine 

In simple terms, the decision algorithm consists in finding (or building) a 3- 
uncolorability witness, subject to a given a level. 

The definition (2.2) of uncolorability witness involves a recurrent term closely re- 
lated to the Zykov [1952] theorem: 

x(G)=mm( X {G/uv),x(G + uv)), (12) 

defined for the pair {u, v} of non-adjacent vertices. 

Hence, we can determine 3-colorability by searching along the tree defined by the 
following recurrence relation: 

X{G) < 3 min { X {G/uv), X (G + uv)) < 3 (13) 

In the proposed algorithm the height of the tree is bounded by a and the selection of 
the uv's follows special priority rules, i.e., contract first every {u, v} belonging to every 
if 112 subgraph. These priority rules will define the search tree and so its size (number 
of nodes). Therefore, the parametric algorithm follows the breadth-first search strat- 
egy. 

Although the size (but not the tree itself) of every Zykov-tree of a given graph G is 
the same, independently of the selected way of branching, the situation is different 
for partial (pruned) Zycov-trees [McDiarmid 1979]. For instance, the use of Ki 12 sub- 
graphs, which are the special subgraphs that involves the essence of the uncolorability 
witnesses, reduce an exponential growing subtree into a linear one by eliminating the 
G + uv branch, allowing to obtain smaller witnesses. 

Hence, given a graph G and a = 0: 

(1) The algorithm will find every ifm subgraph of G and contract its non-complete vertices. 

(2) If in doing this a K4 subgraph is produced then there is a 3-uncolorability witness. 

(3) If in doing this a A'3 subgraph is produced then there is a legal 3-coloring. 

(4) If all K112 subgraphs have been processed so that G does not contain neither a K112 nor a 
K4 subgraph then return "undetermined". 
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The algorithm described above is the base case (a = 0) of the decision algorithm. 
Nevertheless, it is not exact. The next step is to define it for every a to obtain an exact 
algorithm. 

Given a value of a = k, the method consist in processing every T 3 i tadpole subgraph 
(x, y, x, w) of G in the following way: 

(1) Test base case a — and return any possible 3-uncolorability witness. 

(2) Select an unvisited T31 tadpole subgraph (a;, y, x, w) of G. 

(3) Test the vertex contraction G/xw and apply the test for a = k — 1. 

(4) If the algorithm obtains a 3-uncolorability witness for G/xw then working with the graph 
G + xw is mandatory and so {y, w} is a forced vertex contraction based on a nested 3- 
uncolorability witness, since G + yw will lead to a K4. 

(5) If in doing this process a K4 subgraph is produced then return and the 3-uncolorability 
witness. 

(6) If all T 3 i tadpole subgraphs have been visited and no contraction has occurred then return 
00 (undetermined). Otherwise continue at step (1) with the current graph. 

In the reference implementation, the main routine of the coloring algorithm has 
been named: planar _3C0L while the decision routine is named is_3colorable (cf. Algo- 
rithm 1). 

The decision routine is_3colorable returns one of three possible results: 



Algorithm 1 is_3colorable(G, a) 

1: N <- \V(G)\ + 1 

2: while \V(G)\ < N do 
3: N<-\V(G)\ 

4: if not Q, G -s— solveJsTi^G) return 0, G 

5: if G is a triangle return 1, G 

6: if a > then G <- solve _T 31 (G, a) 

7: end while 

8: return 00, 



The routine implementation has two subroutines: solve_ifii 2 and solve_T31. The 
solve_i^n2 routine applies a vertex contraction for every Kn 2 subgraph and detects 
wether the graph contains a K4 until no other Km is found. Pseudo-code of the routine 
is shown in Algorithm 2. 

The solve_T31 comprises a more sophisticated test. The solve_T31 routine (cf. Al- 
gorithm 3) operates on T31 tadpole graphs (Fig. IB). The key idea behind the test is 
that T 3 i subgraphs impose only binary constraints, i.e., either {x,w} or {y,w} must 
be contracted since T31 + xw + yw is a if 4 (the same occurs with square graphs but 



Algorithm 2 solve K 112(G) 
l: for all {x, y, z, w} — Kn 2 do 

2: G <— G/zw {contract the non-complete vertices {z, w} } 

3: if K 4 C G return 0, G 

4: end for 

5: return 2, G 



is_3colorable 



( 



(0, Wo), a 3-uncolorability witness Wo was found; 
(1, Wi), a 3-coloring Wi was found; 
(00, 0) , undetermined for the given a 



(14) 
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Algorithm 3 solve_T 3 i(G, a) 

1: for all {x, y, z, w} = T 3 i do 

2: if not is_3colorable(G/xw, a — 1) then 

3: G G/yw {contraction G/xw failed hence contract G/yw} 

4: return G 

5: end if 

6: end for 

7: return G 



these are not used in the current algorithm). Thus the algorithm proceeds by contract- 
ing G/yw whenever G/xw is not 3-colorable. The solve_T31 routine also makes a call 
to is_3colorable when the current recursive deep is less than the maximum recursive 
level a. 

3.2. The coloring routine (planar graphs) 

The coloring routine, planar_3CDL, uses the is_3colorable algorithm and lemma 2.5 
to try to obtain a legal 3-coloring. 

The main procedure of the algorithm consist in reducing G to a planar triangulation 
by means of the addition/contraction of the planar preserving edges of G (we have 
assumed that the graph is planar). 

A key-idea is to consider the return value (oo, 0) of the is_3colorable as that "there 
is no evidence that the graph is not 3-colorable" so assume it is 3-colorable. To reduce 
G to a planar triangulation, there are basically two equivalent options given a list of 
planar preserving edges p(E): 

(1) The first option is to apply sequentially a vertex contraction to each element {u, v} 
of p(E) and verify 3-colorability of G/uv, if G/uv is not 3-colorable then revert the 
vertex contraction and add the edge uv to G. 

(2) The second option is just to add edges first and reverting them to a vertex contrac- 
tion. 

Both options are equivalent, but the firth one (vertex contraction) reduces the order 
(number of vertices) of G and this should improve the performance of the algorithm 
which is better when G/uv is expected to remain 3-colorable as the current graph. 

At the end, if the triangulation has all degrees even then it is 3-colorable [Heawood 
1898; Jensen and Toft 1995] and finding a legal 3-coloring is immediate. Otherwise, 
the algorithm returns "undetermined", meaning that a was not enough to obtain a 
witness for the input graph. A pseudo-code is shown in the planar_3CDL routine (cf. 
Algorithm 4). 

Finally, as a way to speedup the search for a legal 3-coloring, a greedy-trial color- 
ing algorithm is designed (try_greedy). The key idea is to try a greedy coloring from 
scratch re-starting the algorithm at each triangle when no legal 3-coloring is found 
until all triangles have been tried. For each triangle, the algorithm selects vertices se- 
quentially and contracts them with the first non-adjacent vertex of the triangle. Thus, 
if the result is a triangle then a legal 3-coloring is found. This routine is shown in 
Algorithm 5. 

3.3. Generalizing to the non-planar case 

Although planar graph 3-coloring is NP-complete, reducing a non-planar graph to a 
planar one preserving 3-colorability usually produce a planar graph two orders of mag- 
nitude bigger than the original one and this may signify that for practical applications, 
the resulting planar graphs could be intractable from a practical application perspec- 
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Algorithm 4 planar _3C0L(G, a) 

1: Q, H <— is_3colorable(i/, a) 

2: if Q ^ oo return Q, H 

3: else if try .greedy (H) return 1, H 

4: while G is not a planar triangulation do 

5: ii, v i — a valid planar preserving edge 

6: Hi- G/uv 

7: Q,H <h- is_3colorable(i?, a) 

8: if Q = 1 return 1,H 

9: else if Q = then G <- G + m; 
10: else if try_greedy(i/) return 1, 
li: else G -s— if {undetermined case, assume 3-colorable} 
12: end while 

13: if triangulation G has an odd vertex return oo 
14: return 1,G 



Algorithm 5 try_greedy(G) 

l: for all triangles {x, y,z} e G do 
2: H^G 

3: for all w eV{G)\{x,y,z} do 

4: if {x, w} i E(G) then H if/xio 

5: else if {y, w} ^ B(G) then H <- 

6: else if {z, w} ^ E{G) then H iJ/zw 

7: else break 

8: end for 

9: if G is a triangle return 1, H 
10: end for 
11: return 0, G 



tive. Hence, it looks imperative to find the way to generalize the algorithm to operate 
directly on non-planar graphs. 

A natural way of generalizing the algorithm, without finding new structural insights 
or heuristics, is to increase the a level up to an "acceptable" number of errors in the 
decision algorithm. Also this is the right way of approaching the problem from the 
parametric complexity methodology, i.e., to determine a fixed value for the parameter 
(a) for which the problem is tractable. 

However, to provide an algorithm that could be proved to be valid in the general case, 
we should take care of triangle-free graphs, so a routine based on tadpole processing 
seems to be incomplete. The most simple modification is to replace the tadpole test by 
a test based on the edge-addition/vertex-contraction of the end-vertices of a P 2 path 
x,y,z: 

(1) Select a P-2, add edge xz to G an test colorability for G + xz and a; 

(2) If G + xz is not 3-colorable then xz is a forced vertex contraction; 

(3) Contract xz and return the graph G/xz to continue working with it. 

It is very easy to verify that this algorithm is correct since a graph without an in- 
duced Pi subgraph is a complete graph. A pseudo-code is shown in Algorithm 6. 

As for the planar version, the general 3-coloring algorithm relies in the 
is_3colorable routine for supporting the search for a legal 3-coloring of the input 
graph G. However, the key-idea in this case is to build a complete vertex, i.e., a vertex 
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Algorithm 6 solve_non_edge(G, a) 

l: for all {x, y, z} = P 2 do 

2: if not is_3colorable(G + xz, a — 1) then 

3: G i— G/xz {edge-addition G + xz failed hence contract G/xz} 
4: return G 
5: end if 
6: end for 
7: return G 



Algorithm 7 general_3C0L(G, a) 

1: Q, H <— is_3colorable(if, a) 

2: if Q ^ co return Q, H 

3: else if try_greedy(_ff) return 1,H 

4: u «— the highest degree vertex of G 

5: while a; is a non-complete vertex, i.e., N(x) c V(G) — {x} do 
6: v <— a non-neighbor of x maximizing \N(x) U AT(y)| 
7: H<- G/uv 

8: Q,H ir- is_3colorable(iJ, a) 

9: if Q = 1 return 1, H 
10: else if Q = Q then G <- G + to 
11: else if try_greedy(_ff) return 1, H 
12: else G <— {undetermined case, assume 3-colorable} 
13: m ^— the highest degree vertex of G 
14: end while 

15: if N(x) is bipartite return 1, a three-coloring of G 
16: return oo, G 



Algorithm 8 BFS_3CDL(G) 

l: for a = to oo do 
2: Q,G = general_3CDL(G, a) 
3: if Q e {0, 1} return Q, G, a 
4: end for 



joined to all the remaining vertices of the graph, so that for testing 3-colorability at the 
end it suffices to test 2-colorability of its neighborhood. A pseudo-code of the general 
3-coloring algorithm (general_3C0L) is shown in Algorithm 7. 

Finally, an automated algorithm can be developed to eliminate the need of specifying 
the a parameter. Even more, this last routine (Algorithm 8) computes the function a(G) 
which is of theoretical interest. 

4. ANALYSIS OF THE ALGORITHM 

The average-case complexity, worst-case complexity and experimental performance of 
the algorithm are analyzed. The average-case is informally presented as a mean of es- 
tablishing the expected behavior over different kind of instances. The worst-case anal- 
ysis establishes the order (Big O) of the algorithm. Finally, the experimental analyses 
confront the algorithm with a series of graph distributions to study its performance 
and contrast it with the theoretical results. 
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Table I: Description of the a priori expected performance of the algorithm with respect 
to 3-colorability and graph density parameters 



KIND 



3-COLORABLE 



NOT 3-COLORABLE 



SPARSE High probability of a short running-time 
(d < d*) due to the existence of many legal color- 

ings. 



d* Harder instances. 

DENSE High probability of a short running-time 

(d > d*) due the existence of many Km subgraphs 

that prune the search, e.g., graphs tend to 

be uniquely colorable. 



High probability of a short running-time 
since almost all non-3-colorable graphs 
contain a K4 and hence the probabil- 
ity of obtaining a i-Q-free non-3-colorable 
graph decreases very fast when the aver- 
age degree falls below the phase transition 
threshold. 

Harder instances. 

High probability of a short running time 
due the existence of many small 3- 
uncolorability witnesses due to the aver- 
age degree, e.g. too many i^-subgraphs. 



* Phase transition threshold: average degree d ~ 4.69. 

4.1. Average-case (expected) complexity 

Table I shows the average case (expected) performance of the algorithm with respect 
to the type of the instance (Yes or No) and the density of the graph, i.e., above/below 
the phase transition threshold. 

In all the cases (except at the phase transition threshold) there is a high proba- 
bility of a short running time. A priori, it may look that the worst-case should occur 
on the sparse non-3-colorable graphs. This observation comes from the fact that for 
this class of graphs it is more complex to obtain a K4 by random edge additions and 
vertex contractions, nevertheless, some restrictions apply! Since the proportion of non- 
3-colorable graphs decreases fast below the threshold and almost all non-colorable 
graphs contain a K 4 then the probability of obtaining a K 4 -free non-3colorable graph 
below the threshold is very small. Even more, it is known that vertex 3-colorability of 
a graph with maximum vertex degree 3 can be determined in polynomial time [Stein- 
berg 1993]. Also every vertex of max degree 2 can be removed from the graph without 
affecting 3-colorability so non-3-colorable sparse graphs are very rare below c < 4 (this 
follows from the sharp-thresholds theory). 

Hence in almost all cases a short running time is expected. By short I mean signifi- 
cantly shorter than the worst-case exact upper bound. 



4.2. Worst-case complexity 

To determine the computational complexity (g) of the whole algorithm we will start by 
analyzing from the central routine: is3colorable. The routine admits a special param- 
eter a that control the level of recursive calls. In order to analyze its complexity, the 
recursion is fixed to a = and once obtained the complexity for a = the complexity 
for a > is established. 

The is3colorable routine depends on the complexity of two subroutines: 
solve_lf 112(G) and solve_T 3 i(G, a), which are called inside a while loop. However, for 
a = the solve_T 3 i(G, a) routine is not called. 

The routine solve_X 1 i 2 (G) has complexity of order at most 0(n 4 ) since it explores 
each K112 whose number grows as the number of combinations of four elements in the 
vertices of G. However, a deeper analysis show us that at each iteration of the while 
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loop, the algorithm performs a vertex contraction until there is no other Kn 2 . This 
means that the loop is bounded by the number of edges of the complement of G which 
has a quadratic 0(n 2 ) order in the number of vertices. Hence: 



g(solve_X 112 ) - (^J (15) 

= 0(n 2 ) (16) 

The complexity of is3colorable routine also depends on a while loop that calls se- 
quentially the solve _ifii 2 (G) routine. The while loop will iterate until the graph be- 
comes a triangle |V(G)| = 3 or earlier, when no other Kn 2 subgraph exits and so no 
vertex contraction is performed. Then, there are only three possibilities at each itera- 
tion of the while loop: 

(1) Decrease the number of elements of V. 

(2) End due to the number of elements in V has not decreased. 

(3) End due to G becomes a triangle, i.e., \V\ = 3. 

The worst-case scenario is the third case; when the loop ends due to |V| = 3, since 
this imply that the loop has been executed exactly n — 3 times (which is the maxi- 
mum possible). Thus the while loop has linear complexity 0(n) and hence the whole 
is_3colorable routine for a = has complexity: 



a = 0; (17) 

g(is_3colorable) = n. (g(solve__ft'ii 2 ))) ; (18) 

g(is_3colorable) = 0(n). (0(n 2 )) ; (19) 

g(is_3colorable) = 0(n 3 ) (20) 

Now, for the a > case, first consider a = 1. In this case, the routine solve_T 3 i is 
called. This routine has complexity 0(n 3 ) since it explores each T 3i tadpole subgraph 
(whose number grows as the number of combinations of four elements) but it works 
by contracting either {x,w} or {y,w} so it is only needed to look for unique x,y,w 
combinations of the vertices of tadpole subgraphs. 

Since the routine recursively calls the is_3colorable algorithm with a — 1 recursion- 
level in each iteration, the worst-case complexity of solve_T 3 i is: 



a= 1 (21) 

g(solve_T 31 )= Q.O(n 3 ) (22) 

g(solve_T 31 ) = 0(n 3 ).0(n 3 ) (23) 

g(solve_T 31 ) = (9(n 6 ), (24) 

and thus: 



a = 1 (25) 
g(is_3colorable) = 0(n 3 ).0(n 6 ) (26) 
= 0(n 9 ), (27) 
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and therefore, for a constant value: a = k, the complexity of the decision algorithm is: 

a= k (28) 
g(is_3colorable) = 0(n 3 ).0{n 6 ) k (29) 

Now, by lemma 2.5, the complexity of an algorithm that finds a 3-coloring is just one 
order higher: 

g(_3CDL) = 0(n).0(n 3 ).0(n 6 ) k (30) 

4.3. Experimental work 

The algorithm has been thoroughly evaluated trough experiments over significant 
samples taken from three different graph distributions: 

(1) Pseudo-random planar graphs. 

(2) Random 4-regular planar graphs. 

(3) Erdos-Renyi (connected) random graphs. 

Random planar graphs [Denise et al. 1996; Bodirsky et al. 2003] are more complex 
and its definition and sampling methods are more difficult to implement. Instead, we 
opted for a simpler approach to generate "pseudo random planar graphs". The pro- 
cedure is to generate a maximal planar graph and select edges from it uniformly at 
random (i.e. with equal probability) to create another graph called a pseudo random 
graph. For this purpose, we used the "Create Random Planar Graph" algorithm im- 
plementation used by Hochstattler and Schliep [2010] (Gato - the Graph Animation 
Toolbox 5 ) for the creation of such random planar graphs. Only one modification was 
included to avoid the obtention of too much graphs containing a i*Q subgraph. The 
idea consist in trying to generate a K±-fxee planar graph during 100 attempts return- 
ing the first encountered if 4 -free graph, otherwise return the 100 th generated graph. 

Random 4-regular planar graphs are also complex to generate. Here, the procedures 
described by Manca [1979], Lehel [1981] and Broersma et al. [1993] to generate all 
the 4-regular planar graphs have been used. In particular, theorem 2 of Broersma 
et al. [1993] is used to generate 4-regular planar graphs. However, there is no theory 
defining a random 4-regular planar graph, so an ad hoc distribution has been spec- 
ified trying to balance the proportion of yes/no instances. The distribution has been 
obtained assigning a probability to each graph transformation (see Broersma et al. 
1993): P((f> A ) = .80, P(<j) B ) = .05, P{cj> c ) = .10 and P{<j> F ) = .05. 

The Erdos and Renyi [1960] random graph, is a very well-known model that allows 
sampling from graphs uniformly at random and also for a fixed number of vertices 
or edges. We followed standard methods to sample from this distribution. The only 
modification is that the generation of a connected graph is assured by generating first 
a simple path passing through all the vertices and then adding the remaining random 
edges. 

The experiments are designed to study the behavior (not just the performance or 
running-time) of the proposed algorithm. For this purpose, there are curves evaluating 
its performance over a particular graph distribution and also there is an initial com- 
parative plot against the backtracking algorithm. The use of backtracking is restricted 
to the study of the algorithm's scalability since it is not possible to use backtracking 
consistently beyond the 100 vertex barrier due to its exponential growth. 



5 CATBox: http://schliep.org/CATBox, Gato: http://gato.sf.net. 
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Table II: Sample used in the scalability test. 



Sample property 


Value 


Sample type: 


Random planar graphs. 


Sample size: 


9000 graphs. 


Sample range: 


From 10 to 100 vertices, counting by 1. 


Group size: 


100 graphs per vertex number. 



All experiments where developed in the Python 6 programming language using its 
standard libraries and other libraries developed by the current author. Other software 
includes the planarity library 7 [Boyer and Myrvold 2004] as well as the above men- 
tioned Gato (the Graph Animation Toolbox) libraries. The experiments where realized 
on common personal computers and no parallelism was used. All time measurements 
are done in seconds using the python's time . clock () 8 function. 

4.3.1. Scaling factor compared against backtracking. The first part of the experimental 
analysis is a comparison between the scaling factor of the proposed algorithm against 
simple backtracking, in order to determine the differences in the behavior of both al- 
gorithms. 

This experiment consisted in generating uniformly random planar graph instances 
from 10 to 100 vertices (counting by 1 and generating 100 graphs for each number) 
and solving each instance with both algorithms. Table II shows the parameters of the 
sample used in the experiment. 

For each algorithm, the mean and maximum (max) running-times where recorded 
as well as some other relevant statistics. Times labeled as t\ correspond to the back- 
tracking algorithm while t 2 times correspond to the proposed parametric algorithm. 
Comparative plots are shown in Fig. 2 where there are six plots from (a) to (f). 

Fig. 2a shows the running-times as a function of the number of vertices for both 
kinds of instance types and for both algorithms. Fig. 2b shows the proportion of 3- 
colorable and non-3-colorable graphs over the total number of graphs per number of 
vertices. It can be seen that the distribution tends to be uniform. Figures 2c and 2d 
also show the running-times as a function of the number of vertices but discriminating 
by the instance types (yes-or-no) so that subtle differences can be observed. 

The general results indicate that there is a crossing point in which backtracking 
continues to grow exponentially while the proposed algorithm remains polynomial (cf. 
Fig. 2a at around 50 vertices), so a clear difference in the behavior of both algorithms is 
observed. This difference is clearer in the non-3-colorable instances (cf. Fig. 2c) where 
the maximum running-times of the parametric algorithm are lower in all the cases. 
Nevertheless, for the 3-colorable instances (cf. Fig. 2d) the difference starts to be clear 
around graphs on 50 vertices. 

Moreover, when running-times are compared as a function of the average degree, 
there is also a significant difference in the behavior of both algorithms. For non-3- 
colorable instances the parametric algorithm exhibits an almost constant performance 
(cf. Fig. 2e) and also a totally uncorrelated curve against backtracking, which on the 
contrary is very sensitive to the average degree. This difference, although in a slightly 
minor degree, can be also observed in the 3-colorable instances as shown in Fig. 2f. 



6 http : //www . python . org/ 

7 http : //code . google . com/p/planarity 

8 See http : / /docs .python. org/library/time . html. 
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Fig. 2: Runtime analysis of the backtracking (ii) vs. the proposed parametric algorithm 
(t 2 ) over random planar graphs between 10 and 100 vertices. 
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Table III: Sample 


used in the random planar graphs test. 


Sample property 


Value 


Sample type: 


Random planar graphs. 


Sample size: 


10000 graphs. 


Sample range: 


From 100 to 1000 vertices, counting by 100. 


Group size: 


100 graphs per vertex group. 



4.3.2. Pseudo-random planar graphs. This experiment consisted in generating uniformly 
random planar graph instances from 100 to 1000 vertices (counting by 100 and gen- 
erating 1000 graphs for each number) and solving each instance with the parametric 
algorithm. Table III shows the parameters of the sample used in the experiment. It is 
not possible to compare the results against backtracking due to its exponential-grow 
running-time. 

For each instance type (Yes/No), the mean and maximum (max) running-times where 
recorded, as well as some other relevant statistics. Comparative plots are shown in 
Figures 3 and 4. 

Fig. 3a shows the running-times as a function of the number of vertices for both 
kinds of instance types. Fig. 3b shows the proportion of 3-colorable and non-3-colorable 
graphs over the total number of graphs per number of vertices. It can be seen that the 
distribution is far from being uniform. Figures 4a and 4b also show the running-times 
as a function of the number of vertices but discriminating by the average degree. 

The results indicate that there is a very significant difference in running-times de- 
pending on the instance type (cf. Fig. 3a). This difference is expected since the proposed 
algorithm returns earlier when a 3-uncolorability witness is found. Even in the case 
when the average degree is considered the difference is high (cf. Figures 4a and 4b). 
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Table IV: Sample used in the random planar 4-regular graphs test. 



Sample property 


Value 


Sample type: 


Random planar 4-regular graphs. 


Sample 


P(4> A ) = .80, P(0 S ) = .05, P(<i> c ) = .10 and 


distribution" : 


P<$ F ) = .05 


Sample size: 


10000 graphs. 


Sample range: 


From 100 to 1000 vertices, counting by 100. 


Group size: 


1000 graphs per vertex group. 



"see Broersma et al. [1993] for the exact meaning of each graph transfor- 
mation operation, i.e., <f> A , 4>b> 4>c an d 4>f- 



4.3.3. Random planar 4-regular graphs. In this experiment, graphs are sampled from an 
ad hoc distribution over the 4-regular planar graphs. The sample consisted in gener- 
ating graph instances from 100 to 1000 vertices (counting by 100 and generating 1000 
graphs for each number) and solving each instance with the parametric algorithm. 
Table IV shows the parameters of the sample used in the experiment. 

For each instance type (Yes/No), the mean and maximum (max) running-times where 
recorded, as well as some other relevant statistics. Comparative plots are shown in 
Figures 3 and 4. 

Fig. 3 a shows the running-times as a function of the number of vertices for both 
kinds of instance types. Fig. 3b shows the proportion of 3-colorable and non-3-colorable 
graphs over the total number of graphs per number of vertices. It can be seen that the 
distribution is far from being uniform. Figures 4a and 4b also show the running-times 
as a function of the number of vertices but discriminating by the average degree. 

The results indicate that there is a very significant difference in running-times de- 
pending on the instance type (cf. Fig. 3a). This difference was expected since the pro- 
posed algorithm returns earlier when a 3-uncolorability witness is found. Even in the 
case when the average degree is considered, the observed difference is very high (cf. 
Figures 4a and 4b). 
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Running-times by type of instance (yes— or— no) 
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Fig. 5: Runtime analysis over random 4-regular planar graphs between 100 and 1000 
vertices 



Table V: Sample used in the random graphs test. 



Sample property 


Value 


Sample type: 


Erdos-Renyi random graphs. 


Sample size: 


10000 graphs. 


Sample range: 


100 vertices. 



4.3.4. Erdos-Renyi random graphs. In the last experiment, graphs are sampled from the 
well-known Erdos-Renyi random graphs distribution. The sample consisted in gener- 
ating graph instances of 100 vertices, generating in total 10000 graphs, and solving 
each instance with the parametric algorithm. Table V shows the parameters of the 
sample used in the experiment. For each instance type (Yes/No), the mean and max- 
imum (max) running-times where recorded, as well as some other relevant statistics. 
Comparative plots are shown in Fig. 6. 

Fig. 6a shows the quantity of 3-colorable and non-3-colorable graphs as a function 
of the average degree, i.e., a phase transition plot. In this case occurring at around 
d = 4.74. Fig. 6b shows the quantity of graphs corresponding to each a(G) value. As 
predicted by the theory, almost all graphs have a(G) < k (for some integer k) and the 
proportion of graphs decreases exponentially with a below the line of 1/2 Q+1 . These 
results confirm the established theoretical bounds. 

Figures 6c and 6d show the running-time as a function of the average degree. It can 
be observed that in the random graphs case, the difference in running-time is not as 
high as the difference observed in the planar graphs case. 

Also, there is a difference in the location of the harder instances for each kind of 
instance-type: the harder instances for the non-3-colorable case are located around 
an average degree of d « 5 while for 3-coloring instances are located slightly below an 
average degree of d ~ 4.8. Although the numbers seems to be very close the shape of the 
running-time curves are not. The shape of the running-time curve in 6d falls sharply 
after 5 while the shape of the running-time curve in 6c does not. This may indicate 
that there is a true difference in the location of the harder instances depending on its 
class (yes-or-no). 
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5. CONCLUSIONS AND FINAL DISCUSSION 

In this article, a parametric exact 3-coloring algorithm has been presented. This is (to 
the best of my knowledge) the first algorithm of its kind for the 3-coloring problem. 

The maximal complexity of the algorithm is controlled by the parameter (a) that 
bounds the recursion deep and determines its running-time: 0{n^°^). The algorithm 
relies on the efficient search of 3-uncolorability witnesses. Here, a formal definition 
of 3-uncolorability witness has been introduced. This is the central theoretical con- 
cept that allowed the development of the proposed algorithm. The definition of 3- 
uncolorability witness presented here is (to the best of my knowledge) the first one 
that is formally defined and the most naturally related to the 3-coloring problem. Also, 
a close relation between the Zykov [1952] theorem and the definition of 3-uncolorability 
witnesses defined here was shown. 

A very significant feature of 3-uncolorability witnesses is that it is possible to obtain 
them from small subgraphs of a particular graph, indeed, as small as four vertices 
(i.e. by finding a K± subgraph). Hence, an interesting theoretical analysis that should 
follow is to study the behavior of a(G) on 4-critical graphs since in that class there is 
no subgraph with chromatic number four and hence finding forced vertex contractions 
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should be harder (see Mizuno and Nishihara 2008 for a good initial development of 
this idea). Hence, a classification of 4-critical graphs based on a(G) could lead to very 
significant results. 

There is an interesting symmetry between colorings and uncolorability witnesses: 

— In order to show that a graph is 3-colorable it suffices to encounter just one legal 
coloring, nevertheless a legal coloring must assign a color to all the vertices of the 
graph without violating any constraint since it remains hard to determine if a par- 
tial coloring is extensible to all the vertices of the graph. 

— Instead, in order to show that a graph is not 3-colorable it is needed to verify 
that none of all possible 3-colorings is a legal one, nevertheless for finding a 3- 
uncolorability witness it suffices to encounter just one non-3-colorable subgraph (e.g. 
a 4-critical subgraph), i.e., a small graph. 

Thus, while for huge graphs just verifying a legal 3-coloring could become impossible 
in practice, it remains practical (at least in theory!) to determine 3-uncolorability even 
for huge graphs. 

Hence, in principle, finding uncolorability witnesses can be assumed to be at least 
of the same kind than finding colorings. Thus, there shouldn't be any problem in 
the development of 3-coloring algorithms based on searching for 3-uncolorability wit- 
nesses, that eventually reach the same level of sophistication and performance than 
its colorings-based peers. 

Also if the algorithm is used as a heuristic, for example to test whether a solution can 
be found quickly ("just by chance") with a lower (efficient) a, the algorithm will search 
for both 3-colorings and 3-uncolorability witnesses at the same time. This makes a 
clear difference to the use of backtracking, greedy-based or randomized 3-coloring al- 
gorithms. Besides, this feature is especially important since then it is not necessary to 
trust the correctness of the algorithm itself, nor the particular implementation used, 
to recognize that the provided solution is correct since the verification of the result 
can be done efficiently using just only the solution provided (a legal 3-coloring or a 
3-uncolorability witness). 

The developed theoretical analysis guarantees some good features of the proposed 
algorithm. The most important one, both for practical and theoretical purposes is that 
while the algorithm relies on the value of a to be able to find a witness, the probability 
that a(G) > k decreases at the rate P(a(G) > k) < l/2 fe+1 , e.g., for k = 19 there is less 
chance than one in a million of not obtaining a solution with the proposed polynomial 
algorithm (i.e. 0.999999 of probability of success), assuming that the input is a random 
graph. 

Thus, while certainly beyond some value of a the running-times would become pro- 
hibitive given the current state of the computing machinery, the developed algorithm 
scales polynomially and the probability of obtaining a solution (success) grows expo- 
nentially on a, i.e.: 



hence any step (i.e. any investment) in computing power technology will mean a huge 
(exponential) growing step in the class of tractable 3-coloring instances. Perhaps it 
could be the case that we can achieve at least a "technological tractability"? I mean 
a guaranteed number of instances such that almost all computational problems of 
practical interest could be solved for a(G) e [0, k], for some integer k. 




P(a(G) >k)< 1/2 



,fc+i 
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It should be also observed that increasing a as the result of technological progress 
implies that a =/= /(input). Does technological progress imply a polynomial algorithm 
for 3 -color ability? 

On the other hand, corollary 1.3 (10) establishes a new definition of computational 
complexity classes under the established parametric conditions: NP(a) and CoNP(a) 
are respectively the subsets of problems which are in NP or CoNP that given a 
(maximum) value of a are also in P. This, as proved here, implies that as a — » oo 
(NP U CoNP) \ P — s- 0, which could have some non-trivial interpretations, for instance, 
following the "physical process criterion for mathematical truth" [Aaronson 2003]: "we 
should expect a mathematical question to have a definite answer, if and only if we can 
phrase the question in terms of a physical process we can imagine" 9 . 

Would any physical constraint to a(G) (i.e. an upper-bound) imply P = NP since P ^ 
NP implies a(G) € [0, oo)? 

Thus, the following questions naturally appear: 

(1) Is there any graph construction mechanism that allows the generation of graphs 
with arbitrarily large a(G)7 And if so, what is the complexity of such algorithm? 

(2) Even more, there is any possible physical machine able to generate one of such 
arbitrarily -large-a(G) -graph instances in finite time? 

(3) In other words, can we assume that it is possible to encounter or to generate, in 
finite time, graphs with arbitrarily large a(G) without violating any physical law? 

In addition, since 3-colorability is NP-complete and to each graph corresponds a 
unique a(G) then a classification based on a(G) of all the NP-complete problem in- 
stances can be done by a reduction of each problem instance to a 3-coloring instance G 
such that a(G) — k for some k e N. 

However, can we define NP as follows? 



i.e., can then NP be defined as the infinite union of problems in P? 

Finally, even determining the infiniteness of a(G), is there -as with maximal degree 
four (A(G) < 4), a k e N such that determining 3-colorability over a class of graphs 
with a(G) <kis still NP-complete?, i.e., P = NP? 

In the maximal degree case, we know that 3-colorability restricted to A(G) < 4 is 
still NP-complete. Nevertheless, the problem is to determine if a polynomial algorithm 
exists or not. 

On the contrary, in the finite a(G) case, we know that 3-colorability restricted to 
a(G) < k is in P. Nevertheless, the problem is to determine whether it is NP-complete 
for some k e N. 
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A. REPRODUCIBILITY 

Working source-code of the algorithm and all the software libraries needed to properly 
use it and experimenting with it have been released and are available at the publishers 
web site. 

Furthermore, there is also a web-application that implements the algorithm inside 
the Google App Engine cloud computing framework. The user can visit the site and 
test the algorithm at the following url: 

— http : //graph-coloring. appspot . com 

The web coloring application just asks for a file where a graph is defined follow- 
ing the plain text version of the simple edge-list DIMACS standard format specifi- 
cation (http://mat.gsia.cmu.edu/COLOR/general/ccformat.ps), such as the .col files 
in http: //mat .gsia. emu. edu/COLOR/instances .html. An example of a graph file "gad- 
get.col" follows: 
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